Picture for Noah D. Goodman

Noah D. Goodman

Endless Terminals: Scaling RL Environments for Terminal Agents

Add code
Jan 27, 2026
Viaarxiv icon

Learning to Simulate Human Dialogue

Add code
Jan 07, 2026
Viaarxiv icon

Scaling up the think-aloud method

Add code
May 29, 2025
Viaarxiv icon

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

Add code
Mar 03, 2025
Figure 1 for Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
Figure 2 for Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
Figure 3 for Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
Figure 4 for Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
Viaarxiv icon

Non-literal Understanding of Number Words by Language Models

Add code
Feb 10, 2025
Figure 1 for Non-literal Understanding of Number Words by Language Models
Figure 2 for Non-literal Understanding of Number Words by Language Models
Figure 3 for Non-literal Understanding of Number Words by Language Models
Figure 4 for Non-literal Understanding of Number Words by Language Models
Viaarxiv icon

Emergent Symbol-like Number Variables in Artificial Neural Networks

Add code
Jan 10, 2025
Figure 1 for Emergent Symbol-like Number Variables in Artificial Neural Networks
Figure 2 for Emergent Symbol-like Number Variables in Artificial Neural Networks
Figure 3 for Emergent Symbol-like Number Variables in Artificial Neural Networks
Figure 4 for Emergent Symbol-like Number Variables in Artificial Neural Networks
Viaarxiv icon

BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery

Add code
Jan 02, 2025
Figure 1 for BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery
Figure 2 for BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery
Figure 3 for BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery
Figure 4 for BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery
Viaarxiv icon

CriticAL: Critic Automation with Language Models

Add code
Nov 10, 2024
Viaarxiv icon

Bayesian scaling laws for in-context learning

Add code
Oct 21, 2024
Viaarxiv icon

Human-like Affective Cognition in Foundation Models

Add code
Sep 19, 2024
Viaarxiv icon